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TITLE OF INVENTION 

HIGH MOLECULAR WEIGHT SURFACE PROTEINS 
OF NON-TYPEABLE HAEMOPHILUS 

FIELD OF INVENTION 
This invention relates to high molecular weight 
proteins of non-typeable haemophilus. 

BACKGROUND TO THE INVENTION 
Non-typeable Haemophilus influenzae are non- 
encapsulated organisms that are defined by their lack of 
reactivity with antisera against known H. influenzae 
capsular antigens . 

These organisms commonly inhabit the upper 
respiratory tract of humans and are frequently 
responsible for infections, such as otitis media , 
sinusitis, conjunctivitis, bronchitis and pneumonia. 
Since these organisms do not have a polysaccharide 
capsule, they are not controlled by the present 
Haemophilus influenzae type b (Hib) vaccines, which are 
directed towards Hib bacterial capsular polysaccharides • 
The non-typeable strains, however, do produce surface 
antigens that can elicit bactericidal antibodies. Two of 
the major outer membrane proteins, P2 and P6, have been 
identified as targets of human serum bactericidal 
activity. However, it has been shown that the P2 protein 
sequence is variable, in particular in the non-typeable 
Haemophilus strains. Thus, a P2 -based vaccine would not 
protect against all strains of the organism. 

There have previously been identified by Barenkamp 
et al ( Pediatr. Infect. Pis. J. . 9:333-339, 1990) a group 
of high-molecular-weight (HMW) proteins that appeared to 
be major targets of antibodies present in human 
convalescent sera. Examination of a series of middle ear 
isolates revealed the presence of one or two such 
proteins in most strains. However, prior to the present 
invention, the structures of these proteins were unknown 
as were pure isolates of such proteins. 
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SUMMARY OF INVENTION 
The inventors, in an effort to further characterize 
the high molecular weight (HMW) Haemophilus proteins, 
have cloned, expressed and sequenced the genes coding for 
5 two immunodominant HMW proteins (designated HMW1 and 
HMW2) from a prototype non-typeable Haemophilus strain 
and have cloned, expressed and almost completely 
sequenced the genes coding for two additional 
immunodominant HMW proteins (designated HMW3 and HMW4) 
10 from another non-typeable Haemophilus strain. 

In accordance with one aspect of the present 
invention, therefore, there is provided an isolated and 
purified gene coding for a high molecular weight protein 
of a non-typeable Haemophilus strain, particularly a gene 
15 coding for protein HMW1, HMW2 , HMW3 or HMW4 , as well as 

any variant or fragment of such protein which retains the 
immunological ability to protect against disease caused 
by a non-typeable Haemophilus strain. In another aspect, 
the invention provides a high molecular weight protein of 
20 non-typeable Haemophilus influenzae which is encoded by 
these genes. 

BRIEF DESCRIPTION OF DRAWINGS 
Figure 1 is a DNA sequence of a gene coding for 
protein HMW1 (SEQ ID NO: 1) ; 
25 Figure 2 is a derived amino acid sequence of protein 

HMW1 (SEQ ID NO: 2) ; 

Figure 3 is a DNA sequence of a gene coding for 
protein HMW2 (SEQ ID NO: 3) ; 

Figure 4 is a derived amino acid sequence of HMW2 
30 (SEQ ID NO: 4) ; 

Figure 5A shows restriction maps of representative 
recombinant phages which contained the HMW1 or HMW2 
structural genes, the locations of the structural genes 
being indicated by the shaded bars; 
3 5 Figure 5B shows the restriction map of the T7 

expression vector pT7-7; 
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Figure 6 contains the DNA sequence of a gene cluster 
for the hmwl gene (SEQ ID NO: 5) , comprising nucleotides 
351 to 4958 (ORF a) (as in Figure 1) , as well as two 
additional downstream genes in the 3' flanking region, 
5 comprising ORFs b, nucleotides 5114-6748 and c 
nucleotides 7062-9011 ; 

Figure 7 contains the DNA sequence of a gene cluster 
for the hmw2 gene (SEQ ID NO: 6) , comprising nucleotides 
792 to 5222 (ORF a) (as in Figure 3) , as well as two 
10 additional downstream genes in the 3' flanking region, 
comprising ORFs b, nucleotides 5375-7009, and c, 
nucleotides 7249-9198 ; 

Figure 8 is a partial DNA sequence of a gene coding 
for protein HMW3 (SEQ ID NO: 7) ; 
15 Figure 9 is a partial DNA sequence of a gene coding 

for protein HMW4 (SEQ ID NO: 8); and 

Figure 10 is a comparison table for the derived 
amino acid sequence for proteins HMW1, HMW2, HMW3 and 
HMW4. 

2 0 GENERAL DESCRIPTION OF INVENTION 

The DNA sequences of the genes coding for HMW1 and 
HMW2, shown in Figures 1 and 3 respectively, were shown 
to be about 80% identical, with the first 1259 base pairs 
of the genes being identical. The derived amino acid 

2 5 sequences of the two HMW proteins, shown in Figures 2 and 

4 respectively, are about 70% identical. Furthermore, 
the encoded proteins are antigenically related to the 
filamentous hemagglutinin surface protein of Bordetella 
pertussis . A monoclonal antibody prepared against 

3 0 filamentous hemagglutinin (FHA) of Bordetella pertussis 

was found to recognize both of the high molecular weight 
proteins. This data suggests that the HMW and FHA 
proteins may serve similar biological functions. The 
derived amino acid sequences of the HMW1 and HMW2 
3 5 proteins show sequence similarity to that for the FHA 
protein. It has further been shown that these 
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antigenically-related proteins are produced by the 
majority of the non-typeable strains of Haemophilus. 
Antisera raised against the protein expressed by the HMW1 
gene recognizes both the HMW2 protein and the 
pertussis FHA. The present invention includes an 
isolated and purified high molecular weight protein of 
non-typeable haemophilus which is antigenically related 
to the B. pertussis FHA , which may be obtained from 
natural sources or produced recombinant ly. 

A phage genomic library of a known strain of 
non-typeable Haemophilus was prepared by standard methods 
and the library was screened for clones expressing high 
molecular weight proteins, using a high titre antiserum 
against HMW's. A number of strongly reactive DNA clones 
15 were plaque-purified and sub-cloned into a T7 expression 
plasmid. It was found that they all expressed either one 
or the other of the two high-molecular-weight proteins 
designated HMW1 and HMW2 , with apparent molecular weights 
of 125 and 120 kDa, respectively, encoded by open reading 
20 frames of 4.6 kb and 4.4 kb, respectively. 

Representative clones expressing either HMW1 or HMW2 
were further characterized and the genes isolated, 
purified and sequenced. The DNA sequence of HMW1 is 
shown in Figure 1 and the corresponding derived amino 
25 acid sequence in Figure 2. Similarly, the DNA sequence of 

HMW2 is shown in Figure 3 and the corresponding derived 
amino acid sequence in Figure 4 . Partial purification of 
the isolated proteins and N-terminal sequence analysis 
indicated that the expressed proteins are truncated since 
their sequence starts at residue number 442 of both full 
length HMW1 and HMW2 gene products. 

Subcloning studies with respect to the hmwl and hmw2 
genes indicated that correct processing of the HMW 
proteins required the products of additional downstream 
3 5 genes. It has been found that both the hmwl and hmw2 
g enes are flanked by two additional downstream open 
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reading frames (ORFs) , designated b and c, respectively, 
(see Figures 6 and 7) . 

The b ORFs are 1635 bp in length, extending from 
nucleotides 5114 to 6748 in the case of hmwl and 
5 nucleotides 5375 to 7009 in the case of hmw2, with their 
derived amino acid sequences 99% identical. The derived 
amino acid sequences demonstrate similarity with the 
derived amino acid sequences of two genes which encode 
proteins required for secretion and activation of 

10 hemolysins of P. mirabilis and S. marcescens . 

The c ORFs are 1950 bp in length, extending from 
nucleotides 7 062 to 9011 in the case of hmwl and 
nucleotides 7249 to 9198 in the case of hmw2 , with their 
derived amino acid sequences 96% identical. The hmwl c 

15 ORF is preceded by a series of 9 bp direct tandem 
repeats. In plasmid subclones, interruption of the hmwl 
b or c ORF results in defective processing and secretion 
of the hmwl structural gene product. 

The two high molecular weight proteins have been 

2 0 isolated and purified and shown to be partially 
protective against otitis media in chinchillas and to 
function as adhesins. These results indicate the 
potential for use of such high molecular proteins and 
structurally-related proteins of other non-typeable 

2 5 strains of Haemophilus influenzae as components in non- 

typeable HaemoEhjJUis influenzae vaccines. 

Since the proteins provided herein are good 
cross-reactive antigens and are present in the majority 
of non-typeable Haemophilus strains, it is evident that 

3 0 these HMW proteins may become integral constituents of a 

universal Haemophilus vaccine. Indeed, these proteins 
may be used not only as protective antigens against 
otitis, sinusitis and bronchitis caused by the 
non-typeable Haemophilus strains, but also may be used as 
3 5 carriers for the protective Hib polysaccharides in a 
conjugate vaccine against meningitis. The proteins also 
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may be used as carriers for other antigens, haptens and 
polysaccharides from other organisms, so as to induce 
immunity to such antigens, haptens and polysaccharides. 
The nucleotide sequences encoding two high molecular 
5 weight proteins of a different non-typeable Haemophilus 
strain (designated HMW3 and HMW4) have been largely 
elucidated, and are presented in Figures 8 and 9. HMW3 
has an apparent molecular weight of 125 kDa while HMW4 
has an apparent molecular weight of 123 kDa- These high 
0 molecular weight proteins are antigenically related to 
the HMW1 and HMW2 proteins and to FHA. Sequence analysis 
of HMW3 is approximately 85% complete and of HMW4 95% 
complete, with short stretches at the 5 '-ends of each 
gene remaining to be sequenced. 
5 Figure 10 contains a multiple sequence comparison of 

the derived amino acid sequences for the four high 
molecular weight proteins identified herein. As may be 
seen from this comparison, stretches of identical peptide 
sequence may be found throughout the length of the 
10 comparison, with HMW3 more closely resembling HMW1 and 
HHW4 more closely resembling HMW2 . This information is 
highly suggestive of a considerable sequence homology 
between high molecular weight proteins from various non- 
typeable Haemophilus strains- 
25 in addition, mutants of non-typeable h. influenzae 

strains that are deficient in expression of HMW1 or HMW2 
or both have been constructed and examined for their 
capacity to adhere to cultured human epithelial cells. 
The hmwl and hmw2 gene clusters have been expressed in L 
3 0 coli and have been examined for in vitro adherence. The 

results of such experimentation demonstrate that both 
HMW1 and HMW2 mediate attachment and hence are adhesins 
and that this function is present even in the absence of 
other H. influenzae surface structures. 
3 5 With the isolation and purification of the high 

molecular weight proteins, the inventors are able to 
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determine the major protective epitopes by conventional 
epitope mapping and synthesize peptides corresponding to 
these determinants to be incorporated in fully synthetic 
or recombinant vaccines. Accordingly, the invention also 
5 comprises a synthetic peptide having an amino acid 

sequence corresponding to at least one protective epitope 
of a high molecular weight protein of a non-typeable 
Haemophilus influenzae . Such peptides are of varying 
length that constitute portions of the high- 
10 molecular-weight proteins, that can be used to induce 
immunity, either directly or as part of a conjugate, 
against the relative organisms and thus constitute 
vaccines for protection against the corresponding 
diseases . 

15 The present invention also provides any variant or 

fragment of the proteins that retains the potential 
immunological ability to protect against disease caused 
by non-typeable Haemophilus strains. The variants may be 
constructed by partial deletions or mutations of th 

2 0 genes and expression of the resulting modified genes to 
give the protein variations. 

EXAMPLES 

Example 1 : 

Non-typeable H. influenzae strains 5 and 12 were 

2 5 isolated in pure culture from the middle ear fluid of 

children with acute otitis media. Chromosomal DNA from 
strain 12 , providing genes encoding proteins HMW1 and 
HMW2, was prepared by preparing Sau3A partial restriction 
digests of chromosomal DNA and fractionating on sucrose 

3 0 gradients. Fractions containing DNA fragments in the 9 

to 2 0 kbp range were pooled and a library was prepared by 
ligation into XEMBL3 arms. Ligation mixtures were 
packaged in vitro and plate-amplified in a P2 lysogen of 
E. coli LE392. 

3 5 For plasmid subcloning studies, DNA from a 

representative recombinant phage was subcloned into the 
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T7 expression plasmid pT7-7 , containing the T7 RNA 
polymerase promoter <P10 f a ribosome-binding site and the 
translational start site for the T7 gene 10 protein 
upstream from a multiple cloning site (see Figure 5B) . 
5 DNA sequence analysis was performed by the dideoxy 

method and both strands of the HMW1 gene and a single 
strand of the HMW2 gene were sequenced. 

Western immunoblot analysis was performed to 
identify the recombinant proteins being produced by 
10 reactive phage clones. Phage lysates grown in LE392 
cells or plaques picked directly from a lawn of LE3 92 
cells on YT plates were solubilized in gel 
electrophoresis sample buffer prior to electrophoresis. 
Sodium dodecy 1 sulfate ( SDS ) -polyacry lamide gel 
15 electrophoresis was performed on 7.5% or 11% 
polyacry lamide modified Laemmli gels. After transfer of 
the proteins to nitrocellulose sheets, the sheets were 
probed sequentially with an E. coli -absorbed human serum 
sample containing high-titer antibody to the high- 
20 molecular-weight proteins and then with alkaline 
phosphatase-conjugated goat anti-human immunoglobulin G 
(IgG) second antibody. Sera from healthy adults contains 
high-titer antibody directed against surface-exposed 
high-molecular-weight proteins of non-typeable |L_ 
25 influenzae . One such serum sample was used as the 

screening antiserum after having been extensively 
absorbed with LE392 cells. 

To identify recombinant proteins being produced by 
E, coli trans f ormed with recombinant p lasmids , the 
3 0 plasmids of interest were used to transform E. coli BL21 

(DE3) /pLysS. The transformed strains were grown to an 
A^oq of 0.5 in L broth containing 50 jig of ampicillin per 
ml. IPTG was then added to 1 mM. One hour later, cells 
were harvested, and a sonicate of the cells was prepared. 
3 5 The protein concentrations of the samples were determined 

by the bicinchoninic acid method. Cell sonicates 
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containing 100 /ig of total protein were solubilized in 
electrophoresis sample buffer, subjected to SDS- 
polyacrylamide gel electrophoresis, and transferred to 
nitrocellulose. The nitrocellulose was then probed 
5 sequentially with the E. coli -absorbed adult serum sample 

and then with alkaline phosphatase-conjugated goat anti- 
human IgG second antibody. 

Western immunoblot analysis also was performed to 
determine whether homologous and heterologous non- 
10 typeable H. influenzae strains expressed high-molecular- 
weight proteins antigenically related to the protein 
encoded by the cloned HMW1 gene (rHMWl) . Cell sonicates 
of bacterial cells were solubilized in electrophoresis 
sample buffer, subjected to SDS-polyacrylamide gel 
15 electrophoresis, and transferred to nitrocellulose. 

Nitrocellulose was probed sequentially with polyclonal 
rabbit rHMWl antiserum and then with alkaline 
phosphatase-conjugated goat anti-rabbit IgG second 
antibody * 

20 Finally, Western immunoblot analysis was performed 

to determine whether non- typeable Haemophilus strains 
expressed proteins antigenically related to the 
filamentous hemagglutinin protein of Bordetella 
pertussis . Monoclonal antibody X3C, a murine 

25 immunoglobulin G (IgG) antibody which recognizes 

filamentous hemagglutinin, was used to probe cell 
sonicates by Western blot. An alkaline phosphatase- 
conjugated goat anti-mouse IgG second antibody was used 
for detection. 

3 0 To generate recombinant protein antiserum, E. coli 

BL21 (DE3) /pLysS was transformed with pHMWl-4 , and 
expression of recombinant protein was induced with IPTG, 
as described above. A cell sonicate of the bacterial 
cells was prepared and separated into a supernatant and 

35 pellet fraction by centrif ugation at 10,000 x g for 30 

min. The recombinant protein fractionated with the 
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pellet fraction. A rabbit was subcutaneous ly immunized 
on biweekly schedule with 1 mg of protein from the pellet 
fraction, the first dose given with Freund's complete 
adjuvant and subsequent doses with Freund's incomplete 
5 adjuvant. Following the fourth injection, the rabbit was 
bled. Prior to use in the Western blot assay, the 
antiserum was absorbed extensively with sonicates of the 
host E. coli strain transformed with cloning vector 
alone. 

10 To assess the sharing of antigenic determinants 

between HMW1 and filamentous hemagglutinin, enzyme-linked 
immunosorbent assay (ELISA) plates (Costar, Cambridge, 
Mass.) were coated with 60 m! of a 4-ug/ml solution of 
filamentous hemagglutinin in Dulbecco's phosphate- 

15 buffered saline per well for 2 h at room temperature. 

Wells were blocked for 1 h with 1% bovine serum albumin 
in Dulbecco's phosphate-buffered saline prior to addition 
of serum dilutions. rHMWl antiserum was serially diluted 
in 0.1% Brij (Sigma, St. Louis, Mo.) in Dulbecco's 

20 phosphate-buffered saline and incubated for 3 h at room 
temperature. After being washed, the plates were 
incubated with peroxidase-conjugated goat anti-rabbit lgG 
antibody (Bio-Rad) for 2 h at room temperature and subse- 
quently developed with 2 , 2 ' -az ino-bis ( 3 - 

25 ethylbenzthiazoline-6-sulfonic acid) (Sigma) at a 
concentration of 0.54 in mg/ml in 0.1 M sodium citrate 
buffer, pH 4.2, containing 0.03% H 2 0 2 . Absorbances were 
read on an automated ELISA reader. 

Recombinant phage expressing HMW1 or HMW2 were 

3 0 recovered as follows. The non-typeable H. influenzae 
strain 12 genomic library was screened for clones 
expressing high -molecular-weight proteins with an E. 
coli-absorbed human serum sample containing a high titer 
of antibodies directed against the high-molecular-weight 
3 5 proteins . 
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Numerous strongly reactive clones were identified 
along with more weakly reactive ones. Twenty strongly 
reactive clones were plaque-purified and examined by 
Western blot for expression of recombinant proteins. 
5 Each of the strongly reactive clones expressed one of two 

types of high-molecular-weight proteins, designated HMW1 
and HMW2. The major immunoreactive protein bands in the 
HMW1 and HMW2 lysates migrated with apparent molecular 
masses of 125 and 120 kDa, respectively. In addition to 

10 the major bands, each lysate contained minor protein 

bands of higher apparent molecular weight. Protein bands 
seen in the HMW2 lysates at molecular masses of less than 
12 0 kDa were not regularly observed and presumably 
represent proteolytic degradation products. Lysates of 

15 LE3 92 infected with the XEMBL3 cloning vector alone were 
non-reactive when immunologically screened with the same 
serum sample. Thus, the observed activity was not due to 
cross-reactive E. coli proteins or XEMBL3 -encoded pro- 
teins. Furthermore, the recombinant proteins were not 

20 simply binding immunoglobulin nonspecif ically , since the 

proteins were not reactive with the goat anti-human IgG 
conjugate alone, with normal rabbit sera, or with serum 
from a number of healthy young infants. 

Representative clones expressing either the HMW1 or 

25 HMW2 recombinant proteins were characterized further. 

The restriction maps of the two phage types were 
different from each other, including the regions encoding 
the HMWl and HMW2 structural genes. Figure 5A shows 
restriction maps of representative recombinant phage 

3 0 which contained the HMWl or HMW2 structural genes. The 
locations of the structural genes are indicated by the 
shaded bars. 

HMWl plasmid subclones were constructed by using the 
T7 expression plasmid T7-7 (Fig. 5A and B) . HMW2 plasmid 
3 5 subclones also were constructed, and the results with 
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these latter subclones were similar to those observed 
with the HMW1 constructs. 

The approximate location and direction of 
transcription of the HMW1 structure gene were initially 
5 determined by using plasmid pHMWl (Fig. 5A) . This 

plasmid was constructed by inserting the 8.5-kb BamHI- 
Sai l fragment from XHMW1 into BamHI- and Sail-cut pT7-7 . 
E. coli transformed with pHMWl expressed an 
immunoreactive recombinant protein with an apparent 

10 molecular mass of 115 kDa, which was strongly inducible 
with XPTG. This protein was significantly smaller than 
the 125-kDa major protein expressed by the parent phage, 
indicating that it either was being expressed as a fusion 
protein or was truncated at the carboxy terminus. 

15 To more precisely localize the 3 ' end of the 

structural gene, additional plasmids were constructed 
with progressive deletions from the 3' end of the pHMWl 
construct. Plasmid pHMWl-1 was constructed by digestion 
of pHMWl with PstI, isolation of the resulting 8.8-kb 

20 fragment, and religation. Plasmid pHMWl-2 was 

constructed by digestion of pHMWl with Hindlll, isolation 
of the resulting 7.5-kb fragment, and religation. E. 
coli transformed with either plasmid pHMWl-1 or pHMWl-2 
also expressed an immunoreactive recombinant protein with 

25 an apparent molecular mass of 115 kDa. These results 

indicated that the 3 ' end of the structural gene was 5 ' 
of the Hindlll site. 

To more precisely localize the 5' end of the gene, 
plasmids pHMWl-4 and pHMWl-7 were constructed. Plasmid 

3 0 pHMWl-4 was constructed by cloning the 5.1-kb BamHI- 

Hind lXI fragment from XHMW1 into a pT7 -7 -derived plasmid 
containing the upstream 3.8-kb EcoRI - BamH i fragment. E. 
coli transformed with pHMWl-4 expressed an immunoreactive 
protein with an apparent molecular mass of approximately 
3 5 160 kDa. Although protein production was inducible with 

IPTG, the levels of protein production in these 
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transformants were substantially lower than those with 
the pHMWl-2 transformants described above. Plasmid 
pHMWl-7 was constructed by digesting pHMWl-4 with Nde l 
and Seel. The 9 . 0-kbp fragment generated by this double 

* 5 digestion was isolated, blunt ended, and religated. E. 

coli transformed with pHMWl-7 also expressed an 

* immunoreactive protein with an apparent molecular mass of 

160 )cDa, a protein identical in size to that expressed by 
the pHMWl-4 transformants. The result indicated that the 

10 initiation codon for the HMW1 structural gene was 3' of 

the Spel site. DNA sequence analysis confirmed this 
conclusion. 

As noted above, the XHMW1 phage clones expressed a 
major immunoreactive band of 125 kDa, whereas the HMW1 

15 plasmid clones pHMWl-4 and pHMWl-7 , which contained what 
was believed to be the full-length gene, expressed an 
immunoreactive protein of approximately 160 kDa. This 
size discrepancy was disconcerting. One possible 
explanation was that an additional gene or genes 

20 necessary for correct processing of the HMW1 gene product 
were deleted in the process of subcloning. To address 
this possibility, plasmid pHMWl-14 was constructed. This 
construct was generated by digesting pHMWl with Nde l and 
Mlul and inserting the 7.6-kbp Ndel-Mlul fragment 

25 isolated from pHMWl-4. Such a construct would contain 

the full-length HMW1 gene as well as the DNA 3 ' of the 
HMW1 gene which was present in the original HMW1 phage. 
E. coli transformed with this plasmid expressed major 
immunoreactive proteins with apparent molecular masses of 

3 0 125 and 160 kDa as well as additional degradation 

products. The 125- and 160-kDa bands were identical to 
the major and minor immunoreactive bands detected in the 
HMW1 phage lysates. Interestingly, the pHMWl-14 

construct also expressed significant amounts of protein 

3 5 in the uninduced condition, a situation not observed with 

the earlier constructs. 
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The relationship between the 125* and 160-kDa 
proteins remains somewhat unclear. Sequence analysis, 
described below, reveals that the HMW1 gene would be 
predicted to encode a protein of 159 kDa. It is believed 
5 that the 160-kDa protein is a precursor form of the 
mature 125-kDa protein, with the conversion from one 
protein to the other being dependent on the products of 
the two downstream genes. 

Sequence analysis of the HMW1 gene (Figure 1) 

10 revealed a 4,608-bp open reading frame (ORF) , beginning 

with an ATG codon at nucleotide 3 51 and ending with a TAG 
stop codon at nucleotide 4959. A putative ribosome- 
binding site with the sequence AGGAG begins 10 bp up- 
stream of the putative initiation codon. Five other in- 

15 frame ATG codons are located within 250 bp of the 
beginning of the ORF, but none of these is preceded by a 
typical ribosome-binding site. The 5 '-flanking region of 
the ORF contains a series of direct tandem repeats, with 
the 7-bp sequence ATCTTTC repeated 16 times. These 

2 0 tandem repeats stop 100 bp 5' of the putative initiation 

codon. An 8 -bp inverted repeat characteristic of a rho- 
independent transcriptional terminator is present, 
beginning at nucleotide 4983, 25 bp 3' of the presumed 
translational stop. Multiple termination codons are 
25 present in all three reading frames both upstream and 
downstream of the ORF. The derived amino acid sequence 
of the protein encoded by the HMW1 gene (Figure 2) has a 
molecular weight of 159,000, in good agreement with the 
apparent molecular weights of the proteins expressed by 

3 0 the HMW1-4 and HMW1-7 transf ormants . The derived amino 

acid sequence of the amino terminus does not demonstrate 
the characteristics of a typical signal sequence. The 
BamHI site used in generation of pHMWl comprises bp 1743 
through 1748 of the nucleotide sequence. The ORF 
3 5 downstream of the BamH I site would be predicted to encode 

a protein of 111 kDa, in good agreement with the 115 kDa 



BNSDOCIO. <WO 931909CA1J_^ 



SUBSTITUTE SHEET 



WO 93/19090 



PCI7US93/02166 



15 

estimated for the apparent molecular mass of the pHMWl- 
encoded fusion protein. 

The sequence of the HMW2 gene (Figure 3) consists of 
a 4,4 31-bp ORF, beginning with an ATG codon at nucleotide 
5 352 and ending with a TAG stop codon at nucleotide 4783. 

The first 1,259 bp of the ORF of the HMW2 gene are 
identical to those of the HMW1 gene. Thereafter, the 
sequences begin to diverge but are 80% identical overall. 
With the exception of a single base addition at 

10 nucleotide 93 of the HMW2 sequence, the 5 '-flanking 
regions of the HMW1 and HMW2 genes are identical for 310 
bp upstream from the respective initiation codons. Thus, 
the HMW2 gene is preceded by the same set of tandem 
repeats and the same putative ribosome-binding site which 

15 lies 5' of the HMW1 gene. A putative transcriptional 

terminator identical to that identified 3' of the HMWl 
ORF is noted, beginning at nucleotide 4804. The 
discrepancy in the lengths of the two genes. is 
principally accounted for by a 186-bp gap in the HMW2 

20 sequence, beginning at nucleotide position 3839. The 
derived amino acid sequence of the protein encoded by the 
HMW2 gene (Figure 4) has a molecular weight of 155,000 
and is 71% identical with the derived amino acid sequence 
of the HMWl gene. 

2 5 The derived amino acid sequences of both the HMWl 

and HMW2 genes (Figures 2 and 4) demonstrated sequence 
similarity with the derived amino acid sequence of 
filamentous hemagglutinin of Bordetella pertussis . a 
surface-associated protein of this organism. The initial 

3 0 and optimized TFASTA scores for the HMWl-f ilamentous 

hemagglutinin sequence comparison were 87 and 186, 
respectively, with a word size of 2. The z score for the 
comparison was 4 5.8. The initial and optimized TFASTA 
scores for the HMW2-f ilamentous hemagglutinin sequence 
35 comparison were 68 and 196, respectively. The z score 
for the latter comparison was 48.7. The magnitudes of 
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the initial and optimized TFASTA scores and the z scores 
suggested that a biologically significant relationship 
existed between the HMW1 and HMW2 gene products and 
filamentous hemagglutinin* When the derived amino acid 
5 sequences of HMW1, HMW2 , and filamentous hemagglutinin 

genes were aligned and compared, the similarities were 
most notable at the amino-terminal ends of the three 
sequences* Twelve of the first 22 amino acids in the 
predicted peptide sequences were identical. In 
10 additional, the sequences demonstrated a common five- 

amino-acid stretch, Asn-Pro-Asn-Gly-Ile, and several 
shorter stretches of sequence identity within the first 
200 amino acids* 
Example 2 : 

15 To further explore the HMW1- filamentous 

hemagglutinin relationship, the ability of antiserum 
prepared against the HMW1-4 recombinant protein (rHMWl) 
to recognize purified filamentous hemagglutinin was 
assessed. The rHMWl antiserum demonstrated ELISA 
20 reactivity with filamentous hemagglutinin in a dose- 

dependent manner. Preimmune rabbit serum had minimal 
reactivity in this assay. The rHMWl antiserum also was 
examined in a Western blot assay and demonstrated weak 
but positive reactivity with purified filamentous 
25 hemagglutinin in this system also. 

To identify the native Haemophilus protein 
corresponding to the HMW1 gene product and to determine 
the extent to which proteins antigenically related to the 
HMW1 cloned gene product were common among other non- 
30 typeable H. influenzae strains, a panel of Haemophilus 
strains was screened by Western blot with the rHMWl 
antiserum. Th . antiserum recognized both a 125- and a 
120-kDa protei : band in the homologous strain 12, the 
putative mature protein products of the HMW1 and HMW2 
3 5 genes, respectively* 
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When used to screen heterologous non-typeable H. 
influenzae strains, rHMWl antiserum recognized high- 
molecular-weight proteins in 75% of 125 epidemiologically 
unrelated strains* In general, the antiserum reacted 
5 with one or two protein bands in the 100- to 150-kDa 
range in each of the heterologous strains in a pattern 
similar but not identical to that seen in the homologous 
strain. 

Monoclonal antibody X3C is a murine IgG antibody 

10 directed against the filamentous hemagglutinin protein of 
B. pertussis. This antibody can inhibit the binding of 
B. per tussis cells to Chinese hamster ovary cells and 
HeLa cells in culture and will inhibit hemagglutination 
of erythrocytes by purified filamentous hemagglutinin. 

15 A Western blot assay was performed in which this 
monoclonal antibody was screened against the same panel 
of non-typeable H. influenzae strains discussed above. 
Monoclonal antibody X3C recognized both the high- 
molecular-weight proteins in non-typeable H. influenzae 

20 strain 12 which were recognized by the recombinant - 
protein antiserum. In addition, the monoclonal antibody 
recognized protein bands in a subset of heterologous non- 
typeable H. influenzae strains which were identical to 
those recognized by the recombinant -protein antiserum. 

25 On occasion, the filamentous hemagglutinin monoclonal 
antibody appeared to recognize only one of the two bands 
which had been recognized by the recombinant-protein 
antiserum. Overall, monoclonal antibody X3C recognized 
high-molecular-weight protein bands identical to those 

3 0 recognized by the rHMWl antiserum in approximately 3 5% of 
our collection of non-typeable H. influenzae strains. 
Example 3 : 

Mutants deficient in expression of HMW1, MW2 or both 
proteins were constructed to examine the role of these 
3 5 proteins in bacterial adherence. The following strategy 

was employed. pHMWl-14 (see Example 1, Figure 5A) was 
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digested with BamHI and then ligated to a kanamycin 
cassette isolated on a 1.3-kb BamHI fragment from pUC4K. 
The resultant plasmid (pHMWl-17) was linearized by 
digestion with Xba l and transformed into non-typeable *L- 
5 influenzae strain 12 , followed by selection for kanamycin 

resistant colonies. Southern analysis of a series of 
these colonies demonstrated two populations of 
transformants, one with an insertion in the HMW1 
structural gene and the other with an insertion in the 
10 HMW2 structural gene. One mutant from each of these 
classes was selected for further studies. 

Mutants deficient in expression of both proteins 
were recovered using the following protocol. After 
deletion of the 2.1-kb fragment of DNA between two EcoRI 
15 sites spanning the 3' -portion of the HMW1 structural gene 

in pHMW-15, the kanamycin cassette from pUC4K was 
inserted as a 1.3-kb EcoRI fragment. The resulting 
plasmid (pHMWl-16) was linearized by digestion with Xbal 
and transformed into strain 12, followed again by 
20 selection for kanamycin resistant colonies. Southern 
analysis of a representative sampling of these colonies 
demonstrated that in seven of eight cases, insertion into 
both the HMW1 and HMW2 loci had occurred. One such 
mutant was selected for further studies. 
25 To confirm the intended pheno types , the mutant 

strains were examined by Western blot analysis with a 
polyclonal antiserum against recombinant HMW1 protein. 
The parental strain expressed both the 125-kD HMW1 and 
the 120-kD HMW2 protein. In contrast, the HMW2* mutant 
3 0 failed to express the 120-kD protein, and the HMW1 mutant 

failed to express the 125-kD protein. The double mutant 
lacked expression of either protein. On the basis of 
whole cell lysates, outer membrane profiles, and colony 
morphology, the wild type strain and the mutants were 
35 otherwise identical with one another. Transmission 
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electron microscopy demonstrated that none of the four 
strains expressed pili. 

The capacity of wild type strain 12 to adhere to 
Chang epithelial cells was examined. In such assays, 
5 bacteria were inoculated into broth and allowed to grow 
to a density of -2 x 10 9 cfu/ml. Approximately 2 x 10 7 
cfu were inoculated onto epithelial cell monolayers, and 
plates were gently centrifuged at 165 x g for 5 minutes 
to facilitate contact between bacteria and the epithelial 

10 surface. After incubation for 30 minutes at 37 °C in 5% 
C0 2 , monolayers were rinsed 5 times with PBS to remove 
nonadherent organisms and were treated with trypsin-EDTA 
(0.05% trypsin, 0.5% EDTA) in PBS to release them from 
the plastic support. Well contents were agitated, and 

15 dilutions were plated on solid medium to yield the number 
of adherent bacteria per monolayer. Percent adherence 
was calculated by dividing the number of adherent cfu per 
monolayer by the number of inoculated cfu. 

As depicted in Table 1 below (the Tables appear at 

20 the end of the descriptive text) , this strain adhered 
quite efficiently, with nearly 90% of the inoculum 
binding to the monolayer. Adherence by the mutant 
expressing HMW1 but not HMW2 (HMW2") was also quite 
efficient and comparable to that by the wild type strain. 

2 5 In contrast, attachment by the strain expressing HMW2 but 

deficient in expression of HMWl (HMWl") was decreased 
about 15-fold relative to the wild type. Adherence by 
the double mutant (HMW1/HMW2) was decreased even 
further, approximately 50-fold compared with the wild 

3 0 type and approximately 3 -fold compared with the HMWl 

mutant. Considered together, these results suggest that 
both the HMWl protein and the, HMW2 protein influence 
attachment to Chang epithelial cells. Interestingly, 
optimal adherence to this cell line appears to require 
3 5 HMWl but not HMW2 . 
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Example 4 : 

Using the plasmids pHMWl-16 and pHMWl-17 (see 
Example 3) and following a scheme similar to that 
employed with strain 12 as described in Example 3, three 
5 non-typeable Haemophilus strain 5 mutants were isolated, 
including one with the kanamycin gene inserted into the 
hmwl-like (designated hmwl) locus , a second with an 
insertion in the hmw2-like (designated hmw4) locus, and 
a third with insertions in both loci . As predicted , 
10 Western immunoblot analysis demonstrated that the mutant 
with insertion of the kanamycin cassette into the hmwi- 
like locus had lost expression of the HMW3 125-kD 
protein, while the mutant with insertion into the hmw2- 
like locus failed to express the HMW4 123-kD protein, 
15 The mutant with a double insertion was unable to express 
either of the high molecular weight proteins. 

As shown in Table 1 below, wild type strain 5 
demonstrated high level adherence, with almost 80% of the 
inoculum adhering per monolayer. Adherence by the mutant 
20 deficient in expression of the HMW2-like protein was also 
quite high. In contrast, adherence by the mutant unable 
to express the, HMWl-like protein was reduced about 5- 
fold relative to the wild type, and attachment by the 
double mutant was diminished even further (approximately 
25 25-fold) . Examination of Giemsa-stained samples 

confirmed these observations (not shown) . Thus, the 
results with strain 5 corroborate the findings with 
strain 12 and the HMW1 and HMW2 proteins. 
Example 5 : 

30 To confirm an adherence function for the HMW1 and 

HMW2 proteins and to examine the effect of HMW1 and HMW2 
independently of other H. influenzae surface structures, 
the hmwl and the hmw2 gene clusters were introduced into 
E. coli DH5a, using plasmids pHMWl-14 and pHMW2-21, 

3 5 respectively. As a control, the cloning vector, pT7-7, 
was also transformed into E. coli DH5a. Western blot 
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analysis demonstrated that E. coli DH5a containing the 
hmwl genes expressed a 125 kDa protein, while the same 
strain harboring the hmw2 genes expressed a 12 0-kDa 
protein. E. coli DH5a containing pT7-7 failed to react 
5 with antiserum against recombinant HMWl. Transmission 
electron microscopy revealed no pili or other surface 
appendages on any of the E. coli strains* 

Adherence by the E. coli strains was quantitated and 
compared with adherence by wild type non-typeable 

10 influenzae strain 12. As shown in Table 2 below, 

adherence by E. coli DH5a containing vector alone was 
less than 1% of that for strain 12. In contrast, E. coli 
DH5a harboring the hmwl gene cluster demonstrated 
adherence levels comparable to those for strain 12. 

15 Adherence by E. coli DH5oc containing the hmw2 genes was 
approximately 6-fold lower than attachment by strain 12 
but was increased 20-fold over adherence by E. coli DH5a 
with pT7-7 alone. These results indicate that the HMW1 
and HMW2 proteins are capable of independently mediating 

20 attachment to Chang conjunctival cells. These results 
are consistent with the results with the H. influenzae 
mutants reported in Examples 3 and 4, providing further 
evidence that, with Chang epithelial cells, HMW1 is a 
more efficient adhesin than is HMW2 . 

25 Experiments with E. coli HB101 harboring pT7-7, 

pHMWl-14, or pHMW2-21 confirmed the results obtained with 
the DH5a derivatives (see Table 2) . 
Example 6 : 

HMW1 and HMW2 were isolated and purified from non- 
30 typeable H. influenzae (NTHI) strain 12 in the following 
manner. Non-typeable Haemophilus bacteria from frozen 
stock culture were streaked onto a chocolate plate and 
grown overnight at 37 °C in an incubator with 5% C0 2 . 
50ml starter culture of brain heart infusion (BHI) broth, 
3 5 supplemented with 10 jig/ml each of hemin and NAD was 

inoculated with growth on chocolate plate. The start r 
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culture was grown until the optical density (O.D. - 
600nm) reached 0.6 to 0.8 and then the bacteria in the 
starter culture was used to inoculate six 500 ml flasks 
of supplemented BHI using 8 to 10 ml per flask. The 
bacteria were grown in 500 ml flasks for an additional 5 
to 6 hours at which time the O.D. was 1.5 or greater, 
cultures were centrifuged at 10,000 rpm for 10 minutes. 

Bacterial pellets were resuspended in a total volume 
of 250 ml of ah extraction solution comprising 0.5 M 
NaCI, 0.01 M Na 2 EDTA, 0.01 M Tris 50 fM. 1,10- 
phenanthroline, pH 7.5. The cells were not sonicated or 
otherwise disrupted. The resuspended cells were allowed 
to sit on ice at 0°C for 60 minutes. The resuspended 
cells were centrifuged at 10,000 rpm for 10 minutes at 
4°C to remove the majority of intact cells and cellular 
debris. The supernatant was collected and centrifuged at 
100,000 xg for 60 minutes at 4°C. The supernatant again 
was collected and dialyzed overnight at 4°C against 0.01 
M sodium phosphate, pH 6.0. 

The sample was centrifuged at 10,000 rpm for 10 
minutes at 4°C to remove insoluble debris precipitated 
from solution during dialysis. The supernatant was 
applied to a 10 ml CM Sepharose column which has been 
pre-equilibrated with 0.01 M sodium phosphate, pH 6. 
Following application to this column, the column was 
washed with 0.01 M sodium phosphate. Proteins were 
elevated from the column with a 0 - 0.5M KC1 gradient in 
0.01 M Na phosphate, pH 6 and fractions were collected 
for gel examination. Coomassie gels of column fractions 
were carried out to identify those fractions containing 
high molecular weight proteins . The fractions containing 
high molecular weight proteins were pooled and 
concentrated to a 1 to 3 ml volume in preparation for 
application of sample to gel filtration column. 

A Sepharose CL-4B gel filtration column was 
equilibrated with phosphate-buffered saline, pH 7.5. The 
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concentrated high molecular weight protein sample was 
applied to the gel filtration column and column fractions 
were collected. Coomassie gels were performed on the 
column fractions to identify those containing high 
5 molecular weight proteins. The column fractions 

containing high molecular weight proteins were pooled. 

The proteins were tested to determine whether they 
would protect against experimental otitis media caused by 
the homologous strain. 

10 Chinchillas received three monthly subcutaneous 

injections with 40 /ig of an HMW1-HMW2 protein mixture in 
Freund's adjuvant. One month after the last injection, 
the animals were challenged by intrabullar inoculation 
with 3 00 cfu of NTHI strain 12. 

15 Infection developed in 5 of 5 control animals versus 

5 of 10 immunized animals. Among infected animals, 
geometric mean bacterial counts in middle ear fluid 7 
days post-challenge were 7.4 x 10 6 in control animals 
verus 1.3 x 10 5 in immunized animals. 

20 Serum antibody titres following immunization were 

comparable in uninfected and infected animals. However, 
infection in immunized animals was uniformly associated 
with the appearance of bacteria down-regulated in 
expression of the HMW proteins, suggesting bacterial 

25 selection in response to immunologic pressure. 

Although this data shows that protection following 
immunization was not complete, this data suggests the HMW 
adhesin proteins are potentially important protective 
antigens which may comprise one component of a multi- 

30 component NTHI vaccine. 
Example 7 : 

A number of synthetic peptides were derived from 
HMWl. Antisera then was raised to these peptides. The 
anti-peptide antisera to peptide HMW1-P5 was shown to 
35 recognize HMWl. Peptide HMW1-P5 covers amino acids 1453 

to 1481 of HMWl, has the sequence 
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VDEVIEAKRILEKVKDLSDEEREALAKLG (SEQ ID NO: 9) , and 
represents bases 1498 to 1576 in Figure 10. 

This finding demonstrates that the DNA sequence and 
the derived protein is being interpreted in the correct 
5 reading frame and that peptides derived from the sequence 

can be produced which will be immunogenic. 

SUMMARY OF DISCLOSURE 
In summary of this disclosure, the present invention 
provides high molecular weight proteins of non-typeable 
10 Haemophilus , genes coding for the same and vaccines 
incorporating such proteins. Modifications are possible 
within the scope of this invention. 
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Table 1. Effect of mutation of high molecular weight 
proteins on adherence to Chang epithelial cells by 
nontypable H. influenzae. 



Strain 

Strain 12 derivatives 
wild type 
HMWl* mutant 
HMW2- mutant 
HMW17HMW2- mutant 



ADHERENCE* 
^ inoculum relative to wild tvnef 



87.7 + 5.9 
6.0 ± 0.9 
89.9 ± 10.8 
2.0 + 0.3 



100.0 + 6.7 
6.8 + 1.0 
102.5 + 12.3 
2.3 ± 0.3 



Strain 5 derivatives 
wild type 

HMWl -like mutant 
HMW2 -like mutant 
double mutant 



78.7 ± 3.2 
15.7 ± 2.6 
103.7 + 14.0 
3.5 + 0.6 



100.0 + 4.1 
19.9 + 3.3 
131.7 + 17.8 
4.4 + 0.8 



* Numbers represent mean (+. standard error of the mean) of 
measurements in triplicate or quadruplicate from representative 
experiments. 

T Adherence values for strain 12 derivatives are relative to strain 12 
wild type; values for strain 5 derivatives are relative to strain 5 wild 
type. 
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Table 2. Adherence by E. coli DH5a and HB101 harboring 
hmwl or hmw2 gene clusters. 



Strain* 

DH5a (pT7-7) 
DH5a (pHMWl-14) 
DH5a (pHMW2-21) 



Adherence relative to 

ZL influenzae strain 12t 

0.7 + 0.02 
114.2 + 15.9 
14.0 ± 3.7 



HB101 (pT7-7) 1.2+0.5 
HB101 (pHMWl-14) 93.6 + 15.8 

HB101 CpHMW2-21) 3.6 + 0.9 



The piasmid pHMWl-14 contains the hmwl gene cluster, while 
pHMW2-21 contains the hmwl gene cluster; pT7-7 is the cloning 
vector used in these constructs. 

f Numbers represent the mean (+. standard error of the mean) of 
measurements made in triplicate from representative experiments. 
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CLAIMS 

What I claim is: 

1. An isolated and purified gene encoding a high 
molecular weight protein of a non-typeable Haemophilus 
strain. 

2. The gene of claim 1 encoding protein HMWl, HMW2, 
HMW3 or HMW4 or a variant or fragment of said protein 
retaining the immunological ability to protect against 
disease caused by a non-typeable Haemophilus strain. 

3. The gene of claim 2 having the DNA sequence shown in 
Figure 1 and encoding protein HMWl having the derived 
amino acid sequence of Figure 2. 

4. The gene of claim 2 having the DNA sequence shown in 
Figure 3 and encoding protein HMW2 having the derived 
amino acid sequence of Figure 4 . 

5. The gene claimed in claim 2 having the partial DNA 
sequence shown in Figure 8 and encoding protein HMW3 
having the derived amino acid sequence of Figure 10. 

6. The gene claimed in claim 2 having the partial DNA 
sequence shown in Figure 9 and encoding protein HMW4 
having the derived amino acid sequence of Figure 10 . 

7. A purified and isolated gene cluster comprising a 
nucleotide sequence for a structural gene encoding a high 
molecular weight protein of a non-typeable Haemophilus 
strain and at least one downstream nucleotide sequence 
for an accessory gene for effecting expression of a gene 
product fully encoded by said structural gene. 

8. The gene cluster claimed in claim 7 comprising a DNA 
sequence coding for protein HMWl or HMW2 and two 
downstream accessory genes . 

9. The gene cluster of claim 8 having the DNA sequence 
shown in Figure 6. 

10. The gene cluster of claim 8 having the DNA sequence 
shown in Figure 7 . 

11. A high molecular weight protein of non-typeable 
Haemophilus which is encoded by a gene as defined in 
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claim 1, or any variant or fragment thereof retaining the 
immunological ability to protect against disease caused 
by a non-typeable Haemophilus strain. 

12. The protein of claim 11 which is HMW1 encoded by the 
DNA sequence shown in Figure 1, having the derived amino 
acid sequence of Figure 2 and having an apparent 
molecular weight of 125 kDa. 

13. The protein claim 11 which is HMW2 encoded by the 
DNA sequence shown in Figure 3 and having the derived 
amino acid sequence of Figure 4 and having an apparent 
molecular weight of 120 kDa. 

14 . An isolated and purified high molecular weight 
protein of non-typeable Haemophilus influenzae which is 
antigenically related to the filamentous hemagglutinin 
surface protein of Bordetella pertussis . 

15. The protein of claim 14 which is HMW1, HMW2, HMW3 or 
HMW4 . 

16. A conjugate comprising a protein as claimed in claim 
11 or 14 linked to a antigen, hapten or polysaccharide 
for eliciting an immune response to said antigen, hapten 
or polysaccharide. 

17. The conjugate as claimed in claim 16 wherein said 
polysaccharide is a protective polysaccharide against 
Haemophilus influenzae type b. 

18 . A synthetic peptide having an amino acid sequence 
corresponding to at least one protective epitope of a 
high molecular weight protein of non-typeable Haemophilus 
influenzae . 

19. The peptide of claim 18 wherein said protein is 
HMWl, HMW2, HMW3 or HMW4 . 
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